Towards Machine-Actionable Modules of a Digital Mathematics Library - The Example of DML-CZ
نویسندگان
چکیده
Publishing and archiving mathematical literature presents its own sets of problems. Reaching the goal of building global digital mathematics library (DML), smaller DMLs play an inevitable role in collecting, validating, digitizing and checking data from smaller publishers. In this paper, we overview the technical challenges of building a machineactionable set of modules we have developed over almost a decade of evolution of the Czech Digital Mathematics Library (DML-CZ). Firstly, we survey methods of effective automated data acquisition from the content providers. Then we show OCR processing of mathematical documents and automated segmentation of plain text references for metadata enhancement and effective DOI look up. Finally we describe connection to the European Digital Mathematics Library (EuDML) project and public interfaces of DML-CZ for the best visibility and accessibility.
منابع مشابه
From Pixels and Minds to the Mathematical Knowledge in a Digital Library
Experience in setting up a workflow from scanned images of mathematical papers into a fully fledged mathematical library is described on the example of the project Czech Digital Mathematics Library DML-CZ. An overview of the whole process is given, with description of all main production steps. DML-CZ has recently been launched to public with more than 100,000 digitized pages.
متن کاملAutomated Processing of TEX-Typeset Articles for a Digital Library
Experience in setting up a comprehensive journal processing system based on the TEX typesetting system with the CEDRAMworkflow is described, following the example of the Archivum Mathematicum journal. The system automates the preparation of issues and simultaneously generates the materials needed for the Czech Digital Mathematics Library project (DML-CZ). The second part of the article describe...
متن کاملDigitization Workflow in the Czech Digital Mathematics Library
Experience in setting up a workflow from scanned images of mathematical writings into a fully fledged mathematical library is described on the example of the project Czech Digital Mathematics Library DML-CZ. An overview of the whole process is given, with detailed description of production steps involving scanned image processing and optical character recognition. Experience gained, lessons lea...
متن کاملAn Experience with Building Digital Open Access Repository DML-CZ
A succesfully built institutional or community repository (e.g. set of workflows) needs a coordinated effort of librarians, IT specialists and representatives of users – content specialists. We will explain and discuss design, technical a political decisions behind building the Czech Digital Mathematics Library DML-CZ (http://dml.cz) in the context of other succesfull thematical community proje...
متن کاملBuilding the Czech Digital Mathematics Library upon DSpace System
The paper describes the process of building the Czech Digital Mathematics Library (DML-CZ) upon DSpace System. At first, the DML-CZ will be briefly introduced. Then we will describe DSpace system and its architecture together with Manakin—a system for building user interface above DSpace. The first technical part of the paper will be about mapping DML-CZ structure onto DSpace structures and abo...
متن کامل